#Open Source

18 articles

TechMay 26, 202615 min

Hy-MT2 1.8B Q4_K_M on M1 Max 64GB: 1.25bit 440MB build does not load on stock llama.cpp yet

Hands-on with Tencent Hy-MT2 1.8B Q4_K_M (1.08GB) on M1 Max 64GB via llama-server. JSON, SRT, HTML, glossary, and minority-language prompts with full input-output pairs. The 1.25bit 440MB build does not load on stock llama.cpp 8990, and 30B-A3B (hy_v3) is not in the Mac route yet.

AI LLM Translation Local LLM Hugging Face Quantization MoE Open Source Mac Apple Silicon Experiment

TechMay 13, 202611 min

VoxCPM2 and OSS TTS in 2026: Irodori-TTS, F5-TTS, and Japanese fine-tune notes

VoxCPM2 sits in the tokenizer-free corner. Mapped vs F5-TTS, CosyVoice2, Irodori-TTS, Style-Bert-VITS2; plus why Japanese TTS still leans on OpenJTalk.

AI TTS Speech Synthesis Voice Cloning Local AI Open Source Fine-tuning

TechApr 29, 202610 min

Why 74HC595 Seven-Segment Displays Flicker and How QUAD7SHIFT Handles the Latch Boundary

Flicker and ghosting on 74HC595-based seven-segment displays often come from latch boundary placement rather than power issues. A look at how QUAD7SHIFT's 16-bit atomic update avoids the problem.

Hardware Open Source Arduino Microcontroller Electronics

TechApr 24, 20269 min

Japan's Digital Agency open-sources its government AI "Gennai" with RAG, self-hosted LLM, and legal-AI templates under commercial-friendly licenses

Japan's Digital Agency released parts of Gennai, the generative AI platform it runs for central-government staff, on GitHub under MIT / CC BY 4.0. The web app and cloud-specific AI templates for AWS, Azure, and Google Cloud are bundled together so local governments and private companies can redeploy the same stack.

AI LLM RAG Open Source National strategy AWS Azure Google Cloud

TechApr 2, 202624 min

Cloudflare's serverless CMS EmDash is now in beta as a WordPress successor

A full-stack serverless CMS built on Astro 6.0, EmDash tries to solve WordPress's long-running plugin security problem with V8-isolate plugin sandboxing.

Cloud WordPress Astro Security Open Source

TechApr 1, 20269 min

TRL v1.0 is a major release that gives LLM post-training a stable foundation

Hugging Face's LLM post-training library TRL has reached v1.0. Stable/Experimental tiers, the stabilization of GRPO/DPO/SFT, and a roadmap that includes asynchronous GRPO all point to a more mature stack.

AI Machine Learning Reinforcement Learning LLM Open Source

TechMar 27, 20266 min

HyperAgents shows that improving the way you improve can transfer beyond coding

Meta AI's HyperAgents performs metacognitive self-correction that optimizes improvement strategies themselves. Self-improvement appears in four non-coding domains, and strategies learned in one domain transfer to another, along with spontaneously acquired persistent memory.

MachineLearning AI AI Agents Open Source Research

TechFeb 12, 20267 min

MioTTS - a lightweight LLM-based TTS built from a custom codec

MioTTS from Aratako is a family of 0.1B to 2.6B Japanese-English TTS models built from scratch around the custom MioCodec. Its key feature is that it runs directly in llama.cpp and Ollama.

AI TTS Speech Synthesis Open Source LLM

TechFeb 7, 20266 min

Qwen3-TTS — Open-source speech synthesis with a single pip install

A technical overview of Qwen3‑TTS from Alibaba’s Qwen team: one‑line pip install, 3‑second voice cloning, natural‑language voice design, and support for 10 languages including Japanese. Apache 2.0 licensed.

AI TTS Speech Synthesis Open Source LLM

TechFeb 6, 20266 min

Qwen3-Omni: An omni-modal MoE that unifies text, image, speech, and video with 3B active parameters

A technical walkthrough of Alibaba's Qwen3-Omni-30B-A3B. An omni-modal model that activates only 3B out of 30B and responds with speech from text/image/audio/video inputs. The article organizes the Thinker–Talker architecture, benchmarks, and the overall Qwen3 MoE family.

AI LLM Open Source Multimodal Voice AI

TechFeb 5, 20264 min

UI-TARS-1.5-7B: a vision AI agent that reached SOTA in GUI grounding

A technical look at ByteDance's UI-TARS-1.5-7B, which beats OpenAI CUA and Claude 3.7 by a wide margin at identifying GUI elements from screenshots, and can run locally with a desktop app.

AI LLM Agent Open Source

TechFeb 4, 20265 min

Qwen3-Coder-Next: A Local Coding Agent with 3B Active Parameters

Technical overview of Alibaba’s Qwen3-Coder-Next. An ultra-efficient MoE with 80B parameters but only 3B activated, runs even on a single RTX 4090. Brings 70%+ SWE-Bench performance to local use.

AI LLM Open Source Agent